Heterogeneous Technology Configurable Fabrics: A Field- Programmable Paradigm for Leveraging Post-CMOS Devices in HPC

نویسنده

  • Ronald F. DeMara
چکیده

Discussions are advocated regarding the merits of Heterogeneous Technology Configurable Fabrics (HTCFs) to realize the needed architectural abstraction as a pathway to manage Extreme Heterogeneity (EH) facing High Performance Computing (HPC) in the post-Moore era. HTCFs are reconfigurable logic arrays that leverage the cooperative roles of emerging spintronic and CMOS devices within a runtime adaptable platform residing as compute-node accelerators within post-exascale computing centers. HTCFs advance the challenges of EH by providing a uniform and familiar programming interface to multiple emerging device technologies. HTCFs leverage and extend the robust experience base and versatility of reconfigurable computing technology, theory, and tools to reduce barriers towards hardware implementation and mass adoption of post-CMOS devices in HPC. Like CMOS-based FPGAs, they assist minimizing communication occurrence and cost while reducing software and hardware bloat. Finally, it will be advocated that HTCFs also enable the new advance of “technology co-design” which trades off alternative mappings between emerging devices and CMOS technology during circuit synthesis-time for placeand-route optimizations, and also during runtime by remapping hardware resource locality to vital features such as non-volatility. Keywords— Hardware Abstraction; Reconfigurable Computing Fabrics; Energy-Aware High Performance Computing; Dynamic Resource Optimization; Spintronics; Hardware Co-Design at Runtime. I. NEW PERSPECTIVE TOWARDS EMBRACING HETEROGENEITY It is advocated to debate the merits and implications of the vignette below as a transformative vision for post-Moore era HPC field-programmable accelerators. The perspective is to leverage both CMOS and post-CMOS devices for their relative strengths, such as fast switching speed, non-volatility, intrinsic addition/multiplication, and thresholding/neuromorphic computation which form a continuum of computational characteristics valuable to scientific computing. In particular, it is envisioned for Heterogeneous Technology Configurable Fabrics (HTCFs) to be available to the programmer and operating system as follows: Dateline: Advanced Scientific Computing Research (ASCR) site, USA; March 1, 2040. A scientific programmer seeks to solve incompressible Navier-Stokes equations facing challenging resolution and accuracy requirements. She considers the processing phases involved and functional modules of the algorithms used along with several hardware attributes. These include a TFLOPS budget, data retention time / non-volatility, and a static vs. dynamic energy profile of the primary modules. She proceeds to map her design onto a palette of computational resources that include some selected spinbased devices within a high gate-equivalent capacity HTCFcomponent, as depicted in Figure 1. This HTCF component is functionally-identical to a Field-Programmable Gate Array (FPGA) based accelerator in use today, except it allows field programmable access and mapping to beyond-CMOS devices. This mapping occurs “in the field,” i.e. in the computational center, without fabrication of custom hardware in the foundry, similar field-programmable access to CMOS-based hardware. She proceeds to enter source code which, as is already done today with FPGA-based translators via conversion to a Verilog functional description and configuration of IP blocks in a familiar FPGA-like specification, synthesis, and place-androute toolchain interface. Technology-aware design optimizers then examine a set of functionally-equivalent, but technologydistinct design mappings realized as a field-programmable interconnection within a heterogeneous fabric. This overcomes two near-term hurdles: (1) CMOS-only designs face challenges Figure 1: HTCF component vision. Physical devices consist of spintronic and CMOS elements within each HTCF Configurable Logic Block (H-CLB). H-CLBs comprise a larger field-programmable fabric of Functional Blocks (FBs) and Switch Blocks (SBs). DWM NML MTJ/SHE chargebased devices spinbased devices

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Scaling Floating-Gate Devices Predicting Behavior for Programmable and Configurable Circuits and Systems

Abstract: This paper presents scaling of Floating-Gate (FG) devices, and the resulting implication to large-scale Field Programmable Analog Arrays (FPAA) systems. The properties of FG circuits and systems in one technology (e.g., 350 nm CMOS) are experimentally shown to roughly translate to FG circuits in scaled down processes in a way predictable through MOSFET physics concepts. Scaling FG dev...

متن کامل

BISM: Built-in Self Map for Crossbar Nano-Architectures

Bottom-up self-assembly process used in the fabrication of nanoscale devices yields significantly more defects compared to conventional top-down lithography used in CMOS fabrication. Therefore, applying defect tolerant design schemes is inevitable to be able to map the design to these programmable fabrics around defects. However, defect-aware design mapping could be very time consuming and comp...

متن کامل

16nm BULK CMOS DOCCCII BASED CONFIGURABLE ANALOG BLOCK DESIGN FOR FIELD PROGRAMMABLE ANALOG ARRAY

Field programmable analog array (FPAA) is a rapidly growing technology for fast prototyping of analog signal processing systems facilitating the quick innovation at industry level with saving in terms of both time and cost. Analog signal processing is much faster and consumes less power than conventional digital signal processing, justifying the need for FPAA. Here we propose a Dual output curr...

متن کامل

Rapid Implementation and Optimisation of DSP Systems on SoPC Heterogeneous Platforms

The emergence of programmable logic devices as processing platforms for digital signal processing applications poses challenges concerning rapid implementation and high level optimization of algorithms on these platforms. This paper describes Abhainn, a rapid implementation methodology and toolsuite for translating an algorithmic expression of the system to a working implementation on a heterog...

متن کامل

Regular Logic Fabrics for Via Patterned Gate Arrays

As transistor feature sizes scale below 100nm, the costs of standard-cell-based application specific integrated circuits (ASICs) are increasing so rapidly that fewer products have sufficient volume to justify the high non-recurring engineering (NRE) costs. Consequently, more designs are relying on fully programmable devices such as field programmable gate arrays (FPGAs). Unfortunately, such ful...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017